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T 



OIPE 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/981 , 900B 



DATE: 03/28/2002 
TIME: 16:29:17 



Input Set : A:\PTO.VSK.txt 

Output Set: N:\CRF3\03282002\I981900B.raw 

3 <110> APPLICANT: Stickler! , Masomeh B 

4 Maqbool, Shahina B 

5 Dale, Bruce E 

7 <120> TITLE OF INVENTION: TRANSGENIC PLANTS CONTAINING LIGNINASE AND CELLULASE WHICH 
DEGRADE LIGNIN 



8 




AND CELLULOSE TO FERMENTABLE SUGARS 








10 


<130> 


FILE 


REFERENCE: 


MSU 4.1-539 








12 


<140> 


CURRENT APPLICATION NUMBER 


: US/09/981, 900B 






C--> 13 


<141> 


CURRENT FILING DATE: 2002-03-18 








15 


<150> 


PRIOR APPLICATION NUMBER: 60/242,408 








16 


<151> 


PRIOR FILING DATE: 2000-10- 


-20 








18 


<160> 


NUMBER OF SEQ ID NOS : 22 










20 


<170> 


SOFTWARE: Patentin version 


3.1 








22 


<210> 


SEQ ID NO: 1 












23 


<211> 


LENGTH: 1110 












24 


<212> 


TYPE 


DNA 












25 


<213> 


ORGANISM: Oryza 


sativa 










27 


<400> 


SEQUENCE: 1 












28 


gggtcggaga 


tgccaccacg 


gccacaaccc 


acgagcccgg 


cgcgacacca 


ccgcgcgcgt 


60 


30 


tgagccagcc 


acaaacgccc 


gcggataggc 


gcgccgcacg 


cggccaatcc 


taccacatcc 


120 


32 


ccggcctccg 


cggctcgagc 


gccgtgccat 


ccgatccgct 


gagttttggc 


tatttatacg 


180 


34 


taccgcggga 


gcctgtgtgc 


agagagtgca 


tctcaagaag 


tactcgagca 


aagaaggaga 


240 


36 


gagcttggtg 


agctgcagag 


atggccccct 


ccgtgatggc 


gtcgtcggcc 


accaccgtcg 


300 


38 


ctcccttcca 


gggctcaagt 


ccaccgccgg 


catgccgtcg 


cccgccgtcc 


gaactccagc 


360 


40 


ttcggcaacg 


tcagcatggc 


ggcaggatca 


ggtgcatgca 


ggtaattacc 


tactgatcca 


420 


42 


acacacattc 


ttcttcttct 


tcttcttctt 


aaccaacatt 


aaccaacaac 


tcaattatcg 


480 


44 


tttattcatt 


gaggtgtggc 


cgattgaggg 


catcaagaag 


ttcgagaccc 


tctcctacct 


540 


46 


gccaccgctc 


accgtggagg 


acctcctgaa 


gcagatcgag 


tacctagctc 


cgttccaagt 


600 


48 


ggtgccctgc 


ctcgagttca 


gcaaggtcgg 


atttgtctac 


cgtgagaacc 


acaagtcccc 


660 


50 


tggatactac 


gacggcaggt 


actggaccat 


gtggaagctg 


cccatgttcg 


ggtgcaccga 


720 


52 


cgccacccag 


gtcgtcaagg 


agctcgagga 


ggccaagaag 


gcgtaccctg 


atgcattcgt 


780 


54 


ccgtatcatc 


ggcttcgaca 


acgttaggca 


ggtgcagctc 


atcagcttca 


tcgcctacaa 


840 


56 


cccgggctgc 


gaggagtctg 


gtggcaacta 


agccgtcatc 


gtcatatata 


gcctcgttta 


900 


58 


attgttcatc 


tctgattcga 


tgatgtctcc 


caccttgttt 


cgtgtgttcc 


cagtttgttt 


960 


60 


catcgtcttt 


tgattttacc 


ggccgtgctc 


tgcttttgtt 


ttttcttttc 


acctgattct 


1020 


62 


ctctctgact 


tgatgtaaga 


gtggtatctg 


ctacgactat 


atgttgtttg 


ggtgaggcat 


1080 


64 


atgtgaatga 


aatctatgaa 


agctccggct 








1110 


67 


<210> 


SEQ ID NO: 2 












68 


<211> 


LENGTH: 38 












69 


<212> 


TYPE 


PRT 












70 


<213> 


ORGANISM: Oryza 


sativa 










72 


<400> 


SEQUENCE: 2 












74 


Met Ala Pro Ser Val Met Ala Ser Ser Ala Thr 


Thr Val Ala Pro Phe 




75 


1 




5 




10 




15 
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78 
79 



Input Set : A:\PTO.VSK.txt: 

Output Set: N:\CRF3\03282002\I981900B.raw 

Gin Gly Ser Ser Pro Pro Pro Ala Cys Arg Arg Pro Pro Ser Glu Leu 
20 25 30 



82 Gin Leu Arg Gin Arg Gin 

83 35 

86 <210> SEQ ID NO: 3 

87 <211> LENGTH: 6 

88 <212> TYPE: PRT 

89 <213> ORGANISM: 
91 <220> FEATURE: 
92 
94 



Artificial Sequence 



<223> OTHER INFORMATION: Signal peptide targets the peroxisomes of 
<400> SEQUENCE: 3 



plants 



. (2512) 

E I beta-1, 4-endoglucanase precursor 



96 Arg Ala Val Ala Arg Leu 

97 1 5 

100 <210> SEQ ID NO: 4 

101 <211> LENGTH: 3004 

102 <212> TYPE: DNA 

103 <213> ORGANISM: Acidothermus cellulolyticus 

105 <220> FEATURE: 

106 <221> NAME/KEY: CDS 

107 <222> LOCATION: (824) 

108 <223> OTHER INFORMATION: 

111 <400> SEQUENCE: 4 

112 ggatccacgt tgtacaaggt cacctgtccg tcgttctggt agagcggcgg gatggtcacc 
114 cgcacgatct ctcctttgtt gatgtcgacg gtcacgtggt tacggtttgc ctcggccgcg 
116 attttcgcgc tcgggcttgc tccggctgtc gggttcggtt tggcgtggtg tgcggagcac 
118 gccgaggcga tcccaatgag ggcaagggca agagcggagc cgatggcacg tcgggtggcc 
120 gatggggtac gccgatgggg cgtggcgtcc ccgccgcgga cagaaccgga tgcggaatag 
122 gtcacggtgc gacatgttgc cgtaccgcgg acccggatga caagggtggg tgcgcgggtc 
124 gcctgtgagc tgccggctgg cgtctggatc atgggaacga tcccaccatt ccccgcaatc 
126 gacgcgatcg ggagcagggc ggcgcgagcc ggaccgtgtg gtcgagccgg acgattcgcc 
128 catacggtgc tgcaatgccc agcgccatgt tgtcaatccg ccaaatgcag caatgcacac 
130 atggacaggg attgtgactc tgagtaatga ttggattgcc ttcttgccgc ctacgcgtta 
132 cgcagagtag gcgactgtat gcggtaggtt ggcgctccag ccgtgggctg gacatgcctg 
134 ctgcgaactc ttgacacgtc tggttgaacg cgcaatactc ccaacaccga tgggatcgtt 
136 cccataagtt tccgtctcac aacagaatcg gtgcgccctc atgatcaacg tgaaaggagt 
138 acgggggaga acagacgggg gagaaaccaa cgggggattg gcg gtg ccg cgc gca 
139 
140 



Val Pro Arg Ala 
1 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
835 



142 


ttg 


egg 


cga 


gtg 


cct 


ggc 


teg 


egg 


gtg 


atg 


ctg 


egg 


gtc 


ggc 


gtc 


gtc 


883 


143 


Leu 


Arg 


Arg 


Val 


Pro 


Gly 


Ser 


Arg 


Val 


Met 


Leu 


Arg 


Val 


Gly 


Val 


Val 




144 


5 










10 










15 










20 




146 


gtc 


gcg 


gtg 


ctg 


gca 


ttg 


gtt 


gcc 


gca 


etc 


gcc 


aac 


eta 


gee 


gtg 


ccg 


931 


147 


Val 


Ala 


Val 


Leu 


Ala 


Leu 


Val 


Ala 


Ala 


Leu 


Ala 


Asn 


Leu 


Ala 


Val 


Pro 




148 










25 










30 










35 






150 


egg 


ccg 


get 


cgc 


gcc 


gcg 


ggc 


ggc 


ggc 


tat 


tgg 


cae 


aeg 


age 


ggc 


egg 


979 


151 


Arg 


Pro 


Ala 


Arg 


Ala 


Ala 


Gly 


Gly 


Gly 


Tyr 


Trp 


His 


Thr 


Ser 


Gly 


Arg 




152 








40 










45 










50 








154 


gag 


ate 


ctg 


gae 


gcg 


aac 


aac 


gtg 


ccg 


gta 


egg 


ate 


gee 


ggc 


ate 


aac 


1027 


155 


Glu 


He 


Leu 


Asp 


Ala 


Asn 


Asn 


Val 


Pro 


Val 


Arg 


He 


Ala 


Gly 


He 


Asn 
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1 =^fi 

X J o 
















60 










65 










158 




ttt 




ttc 


gaa 


acc 


tgc 


aat 


tac 


ate 
y 


y uy 


cac 


aat 

yy u 


etc 


taa 

uy y 


tea 


1075 




irp 




oxy 


flic 


m n 
ox u. 


± ii±. 


>-.y o 


A c n 


T VT 

1 yx 


Va 1 

V dX 


Va 1 

V dX 


ri X o 


c^ V 

VJ X y 


Leu 


T Tn 
X X 


Ser 




± D U 




70 










7 S 










RO 












162 


CCfC 


gac 


tac 


cgc 


age 


atg 


etc 


gac 


cag 


ata 


aag 


tea 


etc 


aac 


tac 


aac 


1123 


J- O J 


f\xy 




T vr 
xyx 


t\jL y 


OCX 


Met 


Leu 


Asp 


Gin 


He 


Lys 


Ser 


Leu 


Glv 

\jxy 


TvT 
xyx 


Asn 




1 fkA 
X D ft 


RS 










QO 










Q5 










100 




X u u 


3.Cd. 


ate 


per or 

^y y 


pI" cr 

^y 


eeg 


tac 


tet 


gac 


gac 


att 


etc 


aag 


ccg 


aap 
y y 


acc 


atg 


1171 


XD / 


T" Vi y 
1 IIX 


Tl 

X Xc 


rtx y 


T.on 


xrX (J 


lyr 


T' 

OCX 


A GO 


A c o 


X X c 


XiCLi. 


T.VQ 

xiy o 




o X y 


X liX 


i'lC L. 




X o o 










1 OS 
-L V/ J 










110 










115 






1 70 
X / u 


ov_y 


^ 3 o 
a. CIO 


y 


a 


a 3 1" 
uCL L. 


L. L> 


tap 


pa rr 


a t rr 
d L-y 


aat 


cag 


gac 


pt a 
^ t_y 


cag 


aat 
y y u 


ctg 


1219 


171 
X / X 


f L U 


& c n 


OfcJX 


Tl o 
X xt; 


& c n 


It lie 


lyr 


O J. 11 


MAt 
1*1 c u 


A Q n 
no 11 


\3 Xii 


A GO 


XlCLl 


O X 11 


c^ V 

o X y 


UCLL 




17 9 
X / z 








1 90 

X Z u 










1 9S 

X Z 










1 ^0 

X J u 








1 74 
X / *± 


cn-y 




uuy 


ppi cr 


y L.^- 


a t (T 
CI u,y 


gac 


aaa 


ate 


gtc 


apa 
y (^y 


tac 


gee 


aat 
y y u 


cag 


ate 


1267 


1 7 S 

X / J 


X liX 


OCX 


XjCU. 




V d X 








JL XC 


Va 1 

V d X 


Ala 


1 yx 


Ala 


o xy 


Gin 


He 




1 7 fi 

X / D 






1 ^ s 

X -5 J 










140 
J. *± u 










14 5 














V- L.y 


cgc 




att 


ett 


gac 


cgc 


cac 


cga 


eeg 


gat 


tap 
uy 


age 


aaa 
y y y 


cag 


1315 


1 7Q 
X / y 


oxy 


XitJU 


riX y 


X xt; 


X xc 


XiCU. 


A CO 


A T*rr 
y 


n J. o 


A T'rr 
rix y 


It X ^ 


A QO 


PVG 

^y o 


OCX 


vjx y 


Gin 








X J U 




















1 fiO 
X D U 












X o z 




gcg 


o L-y 


tgg 


■f" 3 r* 


a PIT 

ciL^y 


a (TP 

y 


a rrp 
ciy o 


rrt p 
y 


t prr 

uoy 


rra rr 
y dy 


rrp t 
y ^ L- 


a pa 
dv^y 


t aa 
uy y 


att 


tec 


1363 


1 ft 
X 0 J 


Sei 


HXa 


Leu 


Trp 


Tyr 


Thr 


Cot* 
OCX 


Got* 


Va 1 
V dx 


Q O 
OCX 


oXU 


A 1 a 

nXd 


T" Vi T" 
X iiX 


T T*o 

irp 


T 1 
X X c 


OCX 




1 RA 
X O ft 


1 fis 

X D D 










1 70 
X / u 










1 7 S 

X / J 










1 RO 
X o u 




1 Rfi 

-LOU 


y CH^ 


ol" rr 

c^ug 




y^y 


p1~ rr 


rrprr 
guy 


pa rr 


prrp 
oy V- 


tap 

U-d 


aag 


y y d 


aac 


ccg 


aeg 


gtc 


gtc 


1411 


1 ft7 
X o / 


A c 


Xiti^U, 


ni n 

uxii 


Al ;=! 


1 

XiCu. 


Ala 


m n 

Vj J. 11 


A T'rr 
£\i. y 


T VT* 

iyx 


i_jy o 


o X y 


A G n 
no 11 


Pto 


Thr 


Val 


Val 




XO O 




I 






IRS 
XO 3 










1 QO 
X y u 










1 Q S 

X J ^ 






1 QO 

JL J u 


y y U 


L. L. L. 


rra o 
y a 


L- t.y 


pap 


aac 


era cr 
y ay 


eeg 


cat 


gac 


eeg 


gee 


t ap 
(_y o 


t aa 
i_y y 


aap 
y y 


tap 
L-y 


1459 


1 Q 1 




File 


Asp 


Leu 


nxS 


Asn 


Pin 

^jXU 


Pro 


nXS 


Asp 


Pro 


A 1 i5 


v^y o 


T T'O 

irp 


oxy 


P\7 C 

o 




1 Q O 
X J Z 








900 

z u u 










90S 










910 

Z X u 








1 


ggc 


y d u 


Oi—y 


a.y 


a1- p 


rra p 
y ciL* 


t rrrr 

egg 


oy 01 


1 1 a 
uuy 


rrpp 
y oo 


rrpp 


rra rr 
y dy 


paa 
uy y 


gcc 


aaa 

y y ^ 


aac 


1507 




r* 1 17 


Asp 


Pro 


Ser 


Tl iii 

ixe 


Asp 


Trp 


Arg 


Leu 


A 1 ^1 
AXd 


A 1 j:^ 
AXd 


Pill 
oXU 


Arg 


A 1 i3 


P 1 T7 

\jxy 


Asn 




1 Qf^ 

xy D 






91 S 
ZXD 










9 9 0 
z z w 










9 9 s 
Z z 










1 Qft 
X J o 


y 






T_cg 


gtg 


aat 
act u 


p prr 
L-^^y 


a a p 

CLCIO 


pt rr 


p t P 


att 


tt P 

l_ L. 


rrt p 
y 


gaa 


arrt 
yy u 


at a 
y uy 


1555 


1 Q Q 

xy y 


a 1 a 


vax 


Leu 


Ser 


1 

vax 


Asn 


Pro 


Asn 


Leu 


Leu 


T 1 o 
X XC 


irllc 


Va 1 
V a. X 




oxy 


Va 1 

V dX 




o n o 
zU u 




o "3 n 










9 "5 =\ 
Z J D 










9 A 0 
Z ft U 












z u z 


ca.g 


age 


tac 


aac 


gga 


gac 


tec 




tgg 


tgg 


gge 


gge 


dd\^ 


r^'t" rr 


v_dd 


rrrr^ 

gga 


1 fiO ^ 

X D U J 


Z U J 


r" 1 n 


Sex 


Tyr 


Asn 


P 1 


Asp 


Ser 


Tyr 


Trp 


Trp 


P 1 \7 

oxy 


P 1 T7 

VjXy 


Asn 


Leu 


oXll 


ni V 
vjxy 




204 


24 D 










n c n 

z dO 










o 1=; 










o c n 
zbU 




206 


gcc 


gge 


eag 


tac 


eeg 


gtc 


gtg 


ctg 


aac 


gtg 


eeg 


aac 


cgc 


ctg 


gtg 


tac 


1651 


207 


Ala 


Gly 


Gin 


Tyr 


Pro 


Val 


Val 


Leu 


Asn 


Val 


Pro 


Asn 


Arg 


Leu 


Val 


Tyr 




208 










265 










270 










275 






210 


teg 


gcg 


cac 


gac 


tac 


gcg 


aeg 


age 


gtc 


tac 


eeg 


cag 


aeg 


tgg 


ttc 


age 


1699 


211 


Ser 


Ala 


His 


Asp 


Tyr 


Ala 


Thr 


Ser 


Val 


Tyr 


Pro 


Gin 


Thr 


Trp 


Phe 


Ser 




212 








280 










285 










290 








214 


gat 


eeg 


ace 


ttc 


cec 


aac 


aac 


atg 


cec 


gge 


ate 


tgg 


aac 


aag 


aac 


tgg 


1747 


215 


Asp 


Pro 


Thr 


Phe 


Pro 


Asn 


Asn 


Met 


Pro 


Gly 


He 


Trp 


Asn 


Lys 


Asn 


Trp 




216 






295 










300 










305 










218 


gga 


tac 


etc 


ttc 


aat 


cag 


aac 


att 


gca 


eeg 


gta 


tgg 


ctg 


gge 


gaa 


ttc 


1795 


219 


Gly 


Tyr 


Leu 


Phe 


Asn 


Gin 


Asn 


He 


Ala 


Pro 


Val 


Trp 


Leu 


Gly 


Glu 


Phe 




220 




310 










315 










320 
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222 


QQt 


acg 


aca 


etg 


caa 


tec 


acg 


ace 


gae 


cag 


acg 


tgg 


ctg 


aag 


aeg 


etc 


1843 


223 


Glv 


Thr 


Thr 


Leu 


Gin 


Ser 


Thr 


Thr 


Asp 


Gin 


Thr 


Trp 


Leu 


Lys 


Thr 


Leu ■ 




224 


325 










330 










335 










340 




226 


QtC 


caa 


tac 


eta 


egg 


ccg 


acc 


gcg 


caa 


tac 


ggt 


gcg 


gae 


age 


ttc 


eag 


1891 


227 


Val 


Gin 


Tvr 

J. J- 


Leu 


Arg 


Pro 


Thr 


Ala 


Gin 


Tyr 


Gly 


Ala 


Asp 


Ser 


Phe 


Gin 




228 










345 










350 










355 






230 


tQQ 


acc 


ttc 


tgg 


tec 


tgg 


aae 


ccc 


gat 


tec 




gae 


aca 


gga 




att 


1939 


231 


Tro 


Thr 


Phe 


Tro 

a. J- ^ 


Ser 


Trp 


Asn 


Pro 


Asp 


Ser 


Gly 


Asp 


Thr 


Gly 


Gly 


He 




232 








360 










365 










370 








234 


etc 


aacf 


crat 


aae 


tgg 


cag 


acg 


gtc 


gae 


aca 


gta 


aaa 


gae 


ggc 


tat 


etc 


1987 


235 


Leu 


Lys 


Asp 


Asp 


Tro 


Gin 


Thr 


Val 


Asp 


Thr 


Val 


Lvs 


Asp 


Gly 


Tyr 


Leu 




236 






375 










380 










385 










238 


QCCT 


cccr 


ate 


aaa 


teg 


teg 


att 


ttc 


gat 


cct 


gtc 




gcg 


tct 


gca 


teg 


2035 


239 


Ala 


Pro 


He 


Lys 


Ser 


Ser 


He 


Phe 


Asp 


Pro 


Val 


Gly 


Ala 


Ser 


Ala 


Ser 




240 




390 










395 










400 












242 


cct 


age 


acrt 


eaa 


cca 


tee 


cca 


tea 

-7 


ata 


teg 


ccg 


tct 


ccg 


teg 


ccg 


age 


2083 


243 


Pro 


Ser 


Ser 


Gin 


Pro 


Ser 


Pro 


Ser 


Val 


Ser 


Pro 


Ser 


Pro 


Ser 


Pro 


Ser 




244 


405 










410 










415 










420 




246 


ccg 


teg 


crccT 


aat 


caa 


aca 


cca 


aca 

-7 


cct 


act 


ccg 


acg 


ccg 


aca 


gee 


age 


2131 


247 


Pxo 


Ser 


Ala 


Ser 


Ara 


Thr 


Pro 


Thr 


Pro 


Thr 


Pro 


Thr 


Pro 


Thr 


Ala 


Ser 




248 










425 










430 










435 






250 


ccg 


acg 


cca 


acg 


ctg 


ace 


cct 


act 


act 


aca 

-7 


ccc 


acg 


ccc 


acg 


gca 


age 


2179 


251 


Pro 


Thr 


Pro 


Thr 


Leu 


Thr 


Pro 


Thr 


Ala 


Thr 


Pro 


Thr 


Pro 


Thr 


Ala 


Ser 




252 








440 










445 










450 








254 


cog 


aca 


cca 


tea 


cca 


acq 


gca 


gee 


tec 


gga 


gee 


cge 


tgc 


acc 


gcg 


agt 


2227 


255 


Pxo 


Thr 


Pro 


Ser 


Pro 


Thr 


Ala 


Ala 


Ser 


Glv 


Ala 


Ara 


Cys 


Thr 


Ala 


Ser 




256 






455 










460 










465 










258 


tac 


eag 


ate 


aae 


aac 


aat 

-7 ^ 


taa 


aac 


aat 


qqc 


ttc 


acg 


gta 


acg 


gtg 


gee 


2275 


259 


Tvr 


Gin 


Val 


Asn 


Ser 


Asp 


Tro 


Glv 


Asn 


Glv 


Phe 


Thr 


Val 


Thr 


Val 


Ala 




260 




470 










475 










480 












262 


ortcr 


aca 


aat 


tee 


aaa 


tec 


ate 


aca 


ace 


aaa 

-7 


aca 


tgg 


acg 


gtc 


agt 


tgg 


2323 


263 


Val 


Thr 


Asn 


Ser 


Glv 


Ser 


Val 


Ala 


Thr 


Lys 


Thr 


Trp 


Thr 


Val 


Ser 


Trp 




264 


485 










490 










495 










500 




266 


aca 


ttc 


gge 


gga 


aat 


eag 


acg 


att 


ace 


aat 


teg 


tgg 


aat 


gca 


gcg 


gtc 


2371 


267 


Thr 


Phe 


Gly 


Gly 


Asn 


Gin 


Thr 


He 


Thr 


Asn 


Ser 


Trp 


Asn 


Ala 


Ala 


Val 




268 










505 










510 










515 






270 


acg 


cag 


aae 


ggt 


cag 


teg 


gta 


acg 


get 


egg 


aat 


atg 


agt 


tat 


aac 


aae 


2419 


271 


Thr 


Gin 


Asn 


Gly 


Gin 


Ser 


Val 


Thr 


Ala 


Arg 


Asn 


Met 


Ser 


Tyr 


Asn 


Asn 




272 








520 










525 










530 








274 


gtg 


att 


cag 


cct 


ggt 


cag 


aac 


acc 


acg 


ttc 


gga 


ttc 


cag 


gcg 


age 


tat 


2467 


275 


Val 


He 


Gin 


Pro 


Gly 


Gin 


Asn 


Thr 


Thr 


Phe 


Gly 


Phe 


Gin 


Ala 


Ser 


Tyr 




276 






535 










540 










545 










278 


acc 


gga 


age 


aae 


gcg 


gca 


ccg 


aca 


gtc 


gee 


tgc 


gca 


gca 


agt 


taa 




2512 


279 


Thr 


Gly 


Ser 


Asn 


Ala 


Ala 


Pro 


Thr 


Val 


Ala 


Cys 


Ala 


Ala 


Ser 








280 




550 










555 










560 












282 


tacgtcgggg agccgacggg agggtccgga 


ccgtcggttc 


cccggcttce acetatggag 


2572 


284 


cgaacccaac aatccggacg gaactgcagg taccagagag 


gaacgaeacg aatgeecgee 


2632 


286 


atctcaaaac ggctgcgagc cggcgtcctc 


geeggggegg 


tgageatcge agectccate 


2692 


288 


gtgccgctgg cgatgcagca tcctgccatc 


gcegegacge 


acgtegacaa tecctatgeg 


2752 
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290 ggagcgacct tcttcgtcaa cccgtactgg gcgcaagaag tacagagcga acggcgaacc 

292 agaccaatgc cactctcgca gcgaaaatgc gcgtcgtttc cacatattcg acggccgtct 

294 ggatggaccg catcgctgcg atcaacggcg tcaacggcgg acccggcttg acgacatatc 

296 tggacgccgc cctctcccag cagcagggaa ccacccctga agtcattgag attgtcatct 

298 acgatctgcc gg 

301 <210> SEQ ID NO: 5 

302 <211> LENGTH: 562 

303 <212> TYPE: PRT 

304 <213> ORGANISM: Acidothermus cellulolyticus 
306 <400> SEQUENCE: 5 



308 


Val 


Pro 


Arg 


Ala 


Leu 


Arg 


Arg 


Val 


Pro 


Glv 


Ser 


Arg 


Val 


Met 


Leu 


Arg 


flQ 


1 

X 


















1 0 










15 




1 9 


V d J- 


m V 


Va 1 

V Cl x 


Va 1 

\ CLJl 


V Q. U. 


Ala 


Val 


Leu 


Ala 


Leu 


Val 


Ala 


Ala 


Leu 


Ala 


Asn 


o ±o 


















25 










30 






316 
~j J- \j 


Leu 


Ala 


Val 


Pro 


Arg 


Pro 


Ala 


Arg 


Ala 


Ala 


Gly Gly 


Glv 


Tvr 

X _y J. 


TrD 

X X p 


His 


31 7 






3S 




















45 








39(1 


T" Vi T* 


DfcJi 


v3 xy 


2i TTT 

t\s. y 


ox u. 


Tl A 
X -L tr 






Ala 


noli 


Asn 


Val 




Val 


Arg 


Tie 


3 91 
























60 










324 


Ala 


Glv 


lie 


Asn 


TrD 


Phe 


Glv 


Phe 


Glu 


Thr 


Cys 


Asn 


Tvr 


Val 


Val 


His 


325 


65 










70 










75 










80 


328 


Gly 


Leu 


Trp 


Ser 


Arg 


Asp 


Tyr 


Arg 


Ser 


Met 


Leu 


Asp 


Gin 


He 


Lys 


Ser 


329 










85 










90 










95 




332 


Leu 


Gly 


Tyr 


Asn 


Thr 


He 


Arg 


Leu 


Pro 


Tyr 


Ser 


Asp 


Asp 


He 


Leu 


Lys 


333 








100 










105 










110 






336 


Pro 


Gly 


Thr 


Met 


Pro 


Asn 


Ser 


He 


Asn 


Phe 


Tyr 


Gin 


Met 


Asn 


Gin 


Asp 


337 






115 










120 










125 








340 


Leu 


Gin 


Gly 


Leu 


Thr 


Ser 


Leu 


Gin 


Val 


Met 


Asp 


Lys 


He 


Val 


Ala 


Tyr 


341 




130 










135 










140 










344 


Ala 


Gly 


Gin 


He 


Gly 


Leu 


Arg 


He 


He 


Leu 


Asp 


Arg 


His 


Arg 


Pro 


Asp 


345 


145 










150 










155 










160 


348 


Cys 


Ser 


Gly 


Gin 


Ser 


Ala 


Leu 


Trp 


Tyr 


Thr 


Ser 


Ser 


Val 


Ser 


Glu 


Ala 


349 










165 










170 










175 




352 


Thr 


Trp 


lie 


Ser 


Asp 


Leu 


Gin 


Ala 


Leu 


Ala 


Gin 


Arg 


Tyr 


Lys 


Gly 


Asn 


353 








180 










185 










190 






356 


Pro 


Thr 


Val 


Val 


Gly 


Phe 


Asp 


Leu 


His 


Asn 


Glu 


Pro 


His 


Asp 


Pro 


Ala 


357 






195 










200 










205 








360 


Cys 


Trp 


Gly 


Cys 


Gly 


Asp 


Pro 


Ser 


He 


Asp 


Trp 


Arg 


Leu 


Ala 


Ala 


Glu 


361 




210 










215 










220 










364 


Arg 


Ala 


Gly 


Asn 


Ala 


Val 


Leu 


Ser 


Val 


Asn 


Pro 


Asn 


Leu 


Leu 


He 


Phe 


365 


225 










230 










235 










240 


368 


Val 


Glu 


Gly 


Val 


Gin 


Ser 


Tyr 


Asn 


Gly 


Asp 


Ser 


Tyr 


Trp 


Trp 


Gly 


Gly 



369 245 250 255 

372 Asn Leu Gin Gly Ala Gly Gin Tyr Pro Val Val Leu Asn- Val Pro Asn 

373 260 265 270 

376 Arg Leu Val Tyr Ser Ala His Asp Tyr Ala Thr Ser Val Tyr Pro Gin 

377 275 280 285 

380 Thr Trp Phe Ser Asp Pro Thr Phe Pro Asn Asn Met Pro Gly He Trp 

381 290 295 300 

384 Asn Lys Asn Trp Gly Tyr Leu Phe Asn Gin Asn He Ala Pro Val Trp 



2812 
2872 
2932 
2992 
3004 




Use of n and / or Xaa has been detected in the 
Sequence Listing. Review the Sequence Listing 
to ensure a corresponding explanation is present 
in the <220> to <223> fields of each sequence 
using n or Xaa. 
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L 


13 M:271 




Current Filing Date differs. 


Replaced 


L 


510 


M 


:341 


W 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# : 


6 


L 


511 


M 


:341 


W 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


IDi : 


6 


L 


518 


M 


:341 


w 


(46) 


"n" 


or 


"Xaa" 


used. 


for 


SEQ 


ID# : 


6 


L 


519 


M 


:341 


w 


(46) 


"n" 


or 


"Xaa" 


used. 


for 


SEQ 


ID# : 


6 


L. 


558 


M 


:341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# : 


6 


L 


559 




: 341 


w 


(46) 


"n" 


or 


"Xaa" 


used. 


for 


SEQ 


ID# : 


6 


L 


672 


M 


:341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID#: 


7 


L 


680 


M 


:341 


w 


(46) 


"n" 


or 


"Xaa" 


used. 


for 


SEQ 


ID#: 


7 


L 


720 


M 


:341 


w 


(46) 


"n" 


or 


"Xaa" 


used. 


for 


SEQ 


ID#: 


7 



file://C:\CRF3\Outhold\VsrI981900B.htm 



3/28/02 



